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INTRODUCTION 


The present document aims to report, from the perspective of a Forensic Science 
consolidated method based on a statistical analysis observational process, the 2022 First 
round Brazilian Election counting data. The observational method used is Benford's Law. 


The observated data used is available on the Brazilian Superior Electoral Court's 
(Tribunal Superior Eleitoral - TSE). 


NEWCOMB-BENFORD’S LAW 


Benford's Law, also called the law of the first digit, or Newcomb-Benford's Law, and the 
law of anomalous numbers, refers to the distribution of digits in various sources of real 
cases. Instead of expected homogeneity, the law states in many naturally occurring 
numbers collections the first significant digit is likely to be small. Without homogeneity, 
this distribution shows digit 1 has probability of appearing 30% in a statistical data set, 
while larger values are less likely to appear. 


Frank Benford has shown this result applies to a wide variety of data sets, including 
electricity bills, addresses, stock prices, private equity prices, population numbers, death 
rates, river lengths, physical and mathematical constants, by power laws (which are very 
common in nature). All these statements are calculated or defined on a logarithmic 
scale. 


NEWCOMB-BENFORD’S LAW APPLICATIONS 


- Judicial Evidence 

- Electoral Data Analysis 

- Macroeconomic Data 

- Tax Fraud Analysis of 

- Genome Data Analysis 

- Scientific Fraud Detection 


Following Benford's Law, or Looking Out for No. 1 


By MALCOLM W. BROWNE AUC. 4, 1998 


DR. THEODORE P. HILL asks his mathematics students at the Georgia 
Institute of Technology to go home and either flip a coin 200 times and 
record the results, or merely pretend to flip a coin and fake 200 results. The 
following day he runs his eye over the homework data, and to the students’ 
amazement, he easily fingers nearly all those who faked their tosses. 


"The truth is,” he said in an interview, “most people don't know the real 
odds of such an exercise, so they can't fake data convincingly.” 


There is more to this than a classroom trick. 


Dr. Hill is one of a growing number of statisticians, accountants and 
mathematicians who are convinced that an astonishing mathematical 
theorem known as Benford's Law is a powerful and relatively simple tool for 
pointing suspicion at frauds, embezzlers, tax evaders, sloppy accountants 
and even computer bugs. 


(Benford’s Law in New York Times) 


Benford’s Law in Brazil 


Written on Auditors Brazilian Court (TCU) website: 


“Several studies have been conducted adopting the hypothesis that fabricated 
data is identified by digits deviation with respect to Benford’s distribution.” 


“Walter Mebane, an American statistician at the University of Michigan, has 
studied election data from several countries, including the United States, Russia, 
and Mexico.” 


"The researcher analyzed the data from the Iranian elections in 2009 and found 
anomalies that strongly indicated the occurrence of fraud in the victory of 
politician Ahmadinejad (Mebane, 2009).” 


[https://portal.tcu.gov.br/imprensa/noticias/aplicacoes-da-lei-de-benford-a-auditoria-de-obras-publicas.htm] 


InovaTCU » Noticias 


Aplicagdes da Lei de Benford a auditoria de obras publicas 


As andalises de pregos nas auditorias de obras pUblicas por vezes ocupam semanas de trabalho do auditor, 
pois, em muitos casos, as planilhas orgamentarias so extensas e de dificil analise. A Lei Newcomb-Benford é 
uma ferramenta de minerag&o de dados, alternativa a Curva ABC, que permite uma selegdo possivelmente 
mais precisa dos servigos das planilhas para andlise de prego. 


/inovatcu/noticias/aplicacoes-da-lei-de-benford-a-auditoria-de-obras-publicas.htm 


Walter Mebane, um estatistico americano da Universidade de Michigan, estudou dados eleitorais de varios 
paises, incluindo os Estados Unidos, Russia e México. Em 2006, ele descobriu que a contagem dos votes 
tendia a seguir a Lei de Benford no segundo digito (Mebane, 2006). O pesquisador analisou os dados das 
eleigdes iranianas em 2009 e encontrou anomalias que indicavam fortemente a ocorréncia de fraude na vitdria 
do politico Anmadinejad (Mebane, 2009). Mebane verificou que, nas cidades com poucos votos invalidos, os 
numeros de Anmadinejad passavam longe da distribuig&o de Benford e que o candidato, nessas situagées, 
possuia uma grande vantagem nos votos. 


Considering the highest national auditing body recognizes and propagates the use of 
Benford Logic in public audits, including at the electoral level, we began to analyze the 


2022 Brazilian presidential election first round data. This technique allows us to make 
very quick conclusions about the numerical sets's consistency. 


NEWCOMB-BENFORD’S LAW IN BRAZILIAN ELECTIONS 


The 2022 Brazilian presidential election first round data analysis revealed statistical sets 
inconsistencies. This fact does not assert itselfs there was intervention or data external 
tampering. This can only be affirmed after meticulous and detailed investigation from 
the standpoint of defending the National Sovereignty interests. There was undoubted 
proof of external access to the electoral operation system in recent past elections. The 
subject was headlined in several media and the electoral system operator itself 
recognized this fact, including the forwarding of documents to police authorities. But the 
2022 numerical indications - regardless of this fact and before considering whether there 
were external vulnerabilities in the last election - corroborate the need for protective 
measures as a precautionary measure. See the electoral zone grouping charts below. 


As an example, this is a panoramic view of data analysis grouped by electoral zones. It is 
possible to verify there are many numerical sets, in different readings and formats, that 


present apparent non-conformities with the NBL parameters, which will be the object of 
specific deeper analysis. 


Pure data was used for the analysis, obtained from the TSE super | Electoral Court) 
data repository. [https: i i i i 


The data was properly decompressed and inserted in appropriate databases - in parallel 
and redundant checking operations - in order to preserve the originality of the data in 
the imported tables. The main analysis was restricted and concentrated on the position 
for president, reaching the candidates with the 4 highest votes, and also comparing the 
"White" and "Null" votes. 


The application of the 1st digit rule for NBL requires numbers that meet all numeric 
series that reach the digits 1 to 9, in the case of Electoral Zones and Cities. Both serve 
the purpose of leveraging the NBL (Newcomb-Benford Law) rule. However, the study 
done by cities, if performed with a very restricted universe, may generate distortions. 


The population distance between the largest and the smallest electoral zone in Brazil has 
a smaller interval than the population distance between the smallest and the largest 
Brazilian municipality. 


The first study part uses the 1st digit NBL rule, in which the 2,637 Electoral Zones in 
Brazil and abroad were evaluated. The presence of distortion is noted, where the 
average deviation exceeds the possible margins of error. 


As an initial planimetric overview, some graphs are presented in sequence. These are the 
data will be further study object (1): 
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(1) This table contains the UMIs - Minimal Informational Units - used for the present study. It will be 
repeated several times during the text. It is generated in Portuguese, because it reflects images captured 
directly from the primary data processing. Its columns are as follows, with their respective meanings in 
English: Digit/Quantity Votes / NBL1st / %1st Digit/ NBL Pattern / Variation / NBL by Zones / Total 
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The cases above (RJ, CE and PE States) were demarcated because they showed a high 
incidence of "zero" in cells where there should be a positive value. They are here placed 
as detected anomalies at the State level examples. However, their isolated study cannot 
be dissociated from the global study with aggregated data. 
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CANDIDATO. 
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LE! DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - POR UF 
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CANDIDATO JAIR BOLSONARO 
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Similarly to the isolated state-level data from DF, AC, BA, and CE, states also show 
anomaly identification in the detected groups records. 


Considering the short time between the expedition of the definitive results and the 
analysis, it is evident the need to unfold these referenced foci as a particularized 
observation point. 


If it were the case of a specific document audit [the primary purpose of Benford's Law, 
i.e., to select indicative focuses and to conduct a document audit], these would be the 
first round of specific document audits potential focuses. This is not possible in the 
current Brazilian electoral model, considering the inexistence of a documental collection 
that could elide the doubts pointed out by Benford's examination in these foci. 


We present below all the data tables referring to the votes obtained by the 4 most voted 
candidates, with the tabulations referring to blank and spoiled votes, in the following 
order: 


a) National Quantitative; 


b) Regional Quantitative. 


A total of 36 graphical data views regarding the LNB analysis in the ist digit are 
presented below. 


First, the national data aggregated by the first digit: 
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CANDIDATO SIMONE TEBET [7] 
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LEI DE BENFORD APLICADA NO 12 DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - BRASIL E EXTERIOR 


CANDIDATO CIROGOMES [7] 
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LEI DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - REGIAO NORDESTE 
CANDIDATO JAIR BOLSONARO(T, 
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CANDIDATO Branco 
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LEI DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - REGIAO NORDESTE 
CANDIDATO GROGOMES —_[-r 
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LEI DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - REGIAO NORTE 
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LEI DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - REGIAO SUL 
CANDIDATO GROGOMES [7] 
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LEI DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - REGIAO SUL 
CANDIDATO SIMONE TEBET (7) 
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LE! DE BENFORD APLICADA NO 1° DIGITO POR ZONAS ELEITORAIS PARA PRESIDENTE - REGIAO SUDESTE 
CANDIDATO Nulo[ 
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CANDIDATO CRO GOMES [77] 
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2ND DIGIT RULE 


Another Benford Law technique used, works with the 2nd digit. The renowned 
researcher Walter Mebane, from University of Michigan, used this technique to evaluate 
the 2009 Iranian elections. Strong anomalies were found that indicated the victory of 
politician Ahmadinejad. In this series, the data were worked by electoral zones, 
considering the votes that reached more than 10 votes, in order to benefit only the 2nd 
digit. 


The historical proportions for the second digit are as follows: 
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In the graphs below, it is possible to see the apparent distortions occurring in the totals 
aggregated by Electoral Zones of the Brazilian states, later grouped by geographic region. 
As shown in the graphs, a data aggregate amount which analysis focuses on the second 


digit can present apparent distortions. These distortions should be the object of specific 
studies, which will be seen in subsequent data series. 


The following is a group of examples in which there are regions by the second digit 
clustering. There are also evidence of anomalies in data distribution: 
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These 4 graphs present - all of them - show an average variation index higher than 10%. 


Regarding the last digit analysis, we have a straight line, with the inversion of the 
parameters, instead of a descending curve with the ideal values of Benford 's Law. There 
is a lot of data and graphs of the entire electoral universal data from all over the country. 
As an example and indicative element, we bring some graphs described below: 
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In these three numerical groups presented, there is a very interesting phenomenon from 
an analytical point of view. Where there should be a line, there is a curve. 


There is another case with similar (but reversed) behavior derived from second digit 
data: 
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In this example, the inverse occurs. Where there should be a "descending curve", there 
is an "almost straight", or a slightly ascending curve. This type of situation (curves where 
there should be straights, or straights where there should be curves, or rising curves 
versus falling curves) materializes the highest level concern sign about the integrity of 
the data sets. This constitutes a strong indicative element. 


This kind of inversion is exactly what happened in the controversial Venezuelan election 
of 2004. The inversion of axes occurred, which may mean that some of the number sets 
are not products of natural distribution, but of random number generators. The 
hypothesis is explained in the video documentary "El Poder de la Matematica", available 
on Youtube. [https://www,voutube,com/watch ?v=IZNiFCvibP8&t=303s] 


The video is described as follows : “"May 25, 2013 Documentary showing the Enron case and the 
2004 Presidential Recall Referendum in Venezuela, in which the use of mathematics, specifically 
Benford's Law, was used to detect fraud". Here are some screenshots taken directly from this 
documentary: 
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In the 7 min and 44 seconds of the video is presented the situation in which the 
inversion between straight lines/curves occurs (highlighted in yellow). It is observed that 
among the 4 numerical groups graphs where there should be a behavior adhering to 
Benford's patterns, this occurs only in 3 groups. One of them does not follow the 
Benford patterns, but rather the patterns of equitable digits distribution. This tends to 
be a pattern anomaly by NBL view. 


This behavior [inversion between curves and straight lines] was identified in at least 4 
situations in the 2022 First Round Brazilian presidential elections. 


CONCLUSION 


i) The sixth column of each table contains the percentage variation that occurred 
between the parameter considered "ideal" for the NBL and the actual data found. In 
several cells this variation had a significant amount. 


ii) Obtaining the average variation of each table we have the following scenario; 


ii.a) 30 times - in 30 scenarios - this difference exceeded the 10% variation mark 
outside the ideal NBL parameter. 


ii.b) In 18 occasions this average discrepancy exceeds 20%; 


ii.c) In 7 occasions it exceeds the average of 30%, 1 of them exceeds 40%, and 
another exceeds 50%. 


(on un oun 


iii) The comparative of items “i”, “ii” e “iii” considers 30 tables (5 regions composed by 
the 4 most voted candidates, plus blank and null votes) referring to the 1st digit analysis; 


iv) There are other types of variations that do not claim to the standards considered 
adequate, considering the second and last digits analysis, as well as there are specific 
situations of more intense aggravation (such as the cases of "zero" cells in the states 
mentioned). However, these data are localized and need to be examined in the context 
of the national data set. 

v) Records found in the data sets generate the inversion between "descending curves", 
"straight", "ascending curves", which also occurred in the 2004 Venezuela case cited in 
the body of the text. 


Considering the analyzed data as well as the evidential profile of Benford's Law (and not 
probative), the most correct procedure to be adopted in sequence should be a deep and 
detailed audit, with analysis and comparison of PHYSICAL DOCUMENTS against the 
graphic evidence identified, in order to be able to elide or confirm the detected 
anomalies. 


Ex positis, and considering the speed and scarcity of time, it is concluded that [as in the 
past] there is a potential for risk of undue penetration and the realization of artificial 
data tampering that justifies the discrepancies found - without this being assertion 
constitutes, at the present time, a definitive evidence. This fact constitutes an 
evidentiary element and may give rise to the adoption of EMERGENT and URGENT 
measures regarding the preservation of NATIONAL SOVEREIGNTY of the Brazilian Nation 
in the face of possible international interests. It is necessary to guarantee the complete 
independence of the results for the second round, in the face of any international threat, 
as recorded. Such measures consist of joint and cooperative action of federative bodies, 
each within its own area of operation. 


Obs: This document is a work in progress and will be expanded in the coming days. The 
accelerated way in which the study was produced and reported may lead to eventual 
spelling and writing inaccuracies, and it is under review immediately after its conclusion. 


